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We propose a construction of frequentist confidence intervals that is ef- 
fective near unphysical regions and unifies the treatment of two-sided and 
upper limit intervals. It is rigorous, has coverage, is computationally sim- 
pie and avoids the pathologies that affect the Likelihood Ratio and related 



X 



constructions. Away from non-physical regions, the results are exactly the 



usual central two-sided intervals. The construction is based on including the 
physical constraint in the derivation of the estimator, leading to an estimator 
with values that are confined to the physical domain. 
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I. INTRODUCTION 



Obtaining confidence intervals near physical boundaries is a long-standing problem. Ex- 
periments designed to detect a non-zero neutrino mass by observing neutrino oscillation or 
to detect a small resonance signal in the presence of background are examples in which a 
negative result may be obtained for a quantity that is intrinsically positive. The difficulty 
arises when the estimate for the Gaussian or Poisson mean, as obtained from the data, 
is near or beyond the physical boundary, in which case the standard (classical) result of 
Neyman's construction is an unphysical or null interval as illustrated in Fig.s |l| and ^. 

For the Gaussian case, Fig. [E[ one obtains central confidence intervals for the mean \i 
constrained to be non-negative, using the sample mean x as the estimator for [i. x sufficiently 
negative leads to the null interval. Despite the fact that the construction has coverage a, 
which means that, for any given true mean, the confidence interval includes that value with 
probability a, the null interval cannot contain a true non-zero mean. It is necessarily one 
of the measured intervals that, with probability 1 — a, fail to contain the true mean. Even 
the non-null intervals obtained by this method for some negative values of the estimator are 
unphysically small in that, for most possible (true) means, the confidence interval does not 
contain the true mean. 

The other difficult case, illustrated in Fig. |2|, is that of Poisson distributed data with 
unknown signal mean /i > 0, in the presence of a background with known mean b; n is 
the result of a single observation. For n < b the interval for /i is unphysically small. For 
sufficiently small n the interval is null. The implausibility of the resulting intervals is well 
illustrated by the example shown. For a background-free (b = 0) experiment that measures 
zero events(n = 0), the 90% upper limit for /i is 2.62, for the explicit construction exhibited 
in Fig. |] 0. For an experiment with known mean background b = 3.0 that measures 0(1) 
events, the upper limit for \x is 0(1.7). Thus the poorer experiment has the potential to yield 
a much smaller (but not believable) upper limit. 



When the estimator takes on a value near or beyond the physical limit, we have informa- 
tion greater than that available when no boundary is present since we know a priori that the 
true value is not beyond the boundary. For the Gaussian case, where the confidence intervals 
are of fixed length for measurements away from the boundary, we expect smaller confidence 
intervals for measurements near or beyond the boundary. The classical construction gives 
this feature. We also know that an estimate for the parameter beyond the physical limit 
is relatively improbable. The flaw in the standard classical method is that increasingly im- 
probable estimates lead to increasingly small and ultimately null confidence intervals. One 
cannot accurately estimate a parameter by making an extremely improbable observation. 
The best result for the determination of a parameter should follow from the most probable 
measurement and, arguably, the smallest confidence interval should be obtained for that 
observation, i.e. x = \x for the Gaussian case and n = b + \x for the Poisson case. 

II. PREVIOUSLY SUGGESTED METHODS FOR OBTAINING IMPROVED 

CONFIDENCE INTERVALS 



A number of suggestions have been made for estimating believable confidence intervals 
for bounded parameters. In the Review of Particle Properties [@||, the Particle Data Group 
suggests several options for revising the intervals described above to make them conservative, 
leading to overcoverage for small true values, and also discusses the use of "Bayesian upper 
limit (s), which must necessarily contain subjective feelings about the possible values of the 
parameter" . 

Recently, several authors have suggested the use of different selection principles for the 
construction of intervals. In the Neyman construction, the confidence belt depends both 
on the properties of the estimator and a selection principle. The Neyman construction can 
be simply described by means of a plot containing values of the estimator on the abscissa 
and values of the parameter on the ordinate. According to some prescription, i.e. the 



selection principle, one selects, for any given value of the parameter, a horizontal interval 
corresponding to a designated probability (the coverage) as determined by the sampling 
distribution of the estimator. The region mapped out in this way for all values of the 
parameter constitutes the confidence belt. After an experiment is performed, yielding a 
specific value for the estimator, the corresponding confidence interval for the parameter, 
with the designated coverage, is the vertical interval contained in the confidence belt at that 
value of the estimator. The most commonly used selection principles (for coverage a) are 
central (probability a within the belt and equal probabilities on either side) and one sided 
(0 lower limit and thus probability a to the left of x upper ). One has the freedom to depart 
from the usual selection principles by, for example, invoking a selection which makes the 
confidence belt as narrow as possible [|J. 

In recently suggested modifications, Ref. || addresses both the Gaussian and Poisson 
cases while Ref. deals only with the Poisson case. These approaches employ ordering 
principles for the selection, i.e. rules which order the outcome probabilities before aggregat- 
ing to give total probability a for each value of the parameter. In particular, the ordering 
is based on the Likelihood Ratio Construction j7] (and a variant), where the physical con- 
straint on the parameter space is used in the computation. These constructions produce 
finite confidence intervals for all values of the classical estimator and also achieve the ad- 
mirable unifying feature that one need not decide beforehand whether to set a confidence 
interval or a confidence bound. However, the intervals obtained are small for improbable 
values of the estimators and share with the classical central construction the difficulty that, 
for a quite improbable value, the confidence interval approaches the null interval. Thus, for 
the Gaussian case, a very negative measured value yields a very small confidence interval 
with lower limit zero. Table X of Ref. || gives the confidence interval for the (non-negative) 
Gaussian mean \i for measured value Xq. For measured value —3.0 (unit variance assumed), 
the 68.27% confidence interval is [0.00, 0.04]. Despite the fact that this construction has 
68.27% coverage, the confidence interval derived from this measurement does not contain 
the true value for most possible true values of the Gaussian mean (excepting those in [0.00, 



0.04]) that can lead to the measurement. The resulting confidence interval is unphysically 
small. It does not imply, in the words of the authors, a high "degree of belief that the true 
value is within the interval. Our construction, which is described below, yields [0, 1.0]. 

For the Poisson example cited above, of an experiment with known mean background b 
of 3.0 and a single observation yielding n = 0, the 90% interval for the signal /i given by 
Ref.s H and || are [0, 1.08] and [0, 1.86] respectively, smaller than the interval given for 
n = 0, b = of [0, 2.44]. Ref. || emphasizes that the reason for obtaining small upper limits 
for n < b is not increased sensitivity to the signal but just that fewer background events 
than expected are observed, and views it as "an undesirable feature from the physical point 
of view" for the upper limit to decrease as b increases. Our construction, described below, 
yields [0, 2.62] for the 6 = case and [0, 4.69] for the b = 3.0 case, thus a larger rather than 
smaller interval for events measured when background is present. Of the constructions 
discussed here, ours is the only one where the upper limit increases rather than decreases as 
b increases for fixed n. 

In recognition of the problem of unphysical intervals, Ref. || introduces the concept of 
"sensitivity" to handle cases in which the measurement is less than the estimated background 
and the confidence interval is suspect. This, however, requires quoting a second value, a 
characteristic of the experiment itself, in addition to the interval quoted. No substitute 
interval is offered. 

The authors of Ref. || construct confidence intervals for the Poisson case. They point 
out that the observation n = implies that zero signal is seen, thus the estimate for /i 
(zero) is independent of b. They argue, therefore, that the confidence interval for /i for n=0 
must be independent of b. Extending the argument, they note that for any observation n, 
one has observed a signal n from the Poisson pdf p(n; jjl + b) and at most a background n. 
Thus they formulate a method of obtaining confidence intervals based on the conditional 
probability to observe n given a background < n and obtain the desired result for n = 
and approximately the classical confidence intervals for n > b. While they identify their 
method as an ordering principle, it is not one in the same sense as Ref.s f| and [fj. The 



confidence belt is not constructed from the sampling distribution of an estimator and hence 
does not have coverage in the usual sense. The method gives intervals that are intuitively 
more satisfying as measures of confidence. However, because the method does not provide 
coverage, one cannot precisely state the probability that the interval encloses the true value. 
Although the intervals determined by the method of || do not have coverage, they can 
be easily modified so that they do, by restructuring the confidence belt, retaining the lower 
limit and adjusting the upper limit so that all horizontal intervals contain probability a. 
If one thus modifies the construction, the procedure represents another selection principle 
applied to the Poisson pdf for the sample mean. For n = independent of b, this method 
gives a 90% upper limit of 2.42. 

III. FREQUENTIST VS BAYESIAN CONFIDENCE INTERVALS 

The methods of Refs. || and || are frequentist, as they are constructed from the sampling 
distribution of an estimator, in this case the sample mean, and have coverage by construction. 
However any estimator may be chosen for the Neyman construction. The method used to 
choose the estimator is arbitrary. The estimator may be a guess, or arrived at by the 
usual techniques of moments or the Maximum Likelihood Method. Although it is in general 



desirable for an estimator to be sufficient and unbiased |12[ , it need not have these properties, 
so long as it possesses other desirable features, e.g. gives an appropriate point estimate of 
the parameter of interest and leads to confidence intervals that are restrictive and believable 
from a physical point of view. Coverage is guaranteed by construction. 

Bayesian confidence intervals are constructed from the Bayesian posterior density, which 
is interpreted as the probability density for the unknown parameter. A selection principle is 
again needed to specify the parameter interval containing the designated probability. The 
Bayesian procedure for confidence intervals does not guarantee coverage because it is not 
obtained from the probability density of a statistic or random variable and can be criticized 
for the subjectiveness inherent in establishing the required Bayesian prior. For a discussion 



of Bayesian methods, the reader is referred to Ref. 0. Our interest is in a frequentist 
method, as described in the following section. 

IV. INTERVALS BASED ON AN ESTIMATOR DERIVED FROM A 
LIKELIHOOD FUNCTION THAT CONTAINS THE PHYSICAL CONSTRAINTS 



The authors cited above have focused on modifying the selection principle to make the 
confidence intervals more believable. However the reason that their constructions lead to 
unphysically small confidence intervals near the boundary of a physical region is that the 
method used to obtain the estimator does not take into account the physical constraint on 
the parameter of interest and the resulting estimator is thus the same as if there were no 
boundary. Even though that estimator is efficient, it is appropriate for a problem other than 
the one under consideration. 

We propose a frequentist method and use the Maximum Likelihood Method to derive the 

estimator employed. Among methods for determining estimators, the Maximum Likelihood 

Method is preferred in that if a consistent estimator exists, the method will produce it 

lTi| |12| . The Likelihood Function chosen explicitly contains the physical constraint and 



leads to an estimator with values within the physical domain that is appropriate for the 
problem. The confidence intervals obtained consequently from the sampling distribution of 
the estimator have coverage by construction, are more physical and support a higher degree 
of belief that the parameter of interest lies within the interval. 

This method follows classical estimation theory; the only new element is that the Like- 
lihood Function explicitly excludes non-physical true values. The determination of the esti- 
mator, its sampling distribution and the confidence intervals follow directly without further 
assumptions. We emphasize that the procedure we are following is not Bayesian and that 
the exclusion of non-physical true values is not equivalent to a uniform Bayesian prior for 
the physical region any more than the usual unconstrained Likelihood Function is viewed as 



containing a uniform Bayesian prior for the entire domain. 



A. Gaussian variates 



We assume that x is normally distributed with no n- negative mean /x and variance a 2 . 
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The likelihood function, when there are N measurements x±,X2, ■■■■x n , is: 
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where 9(/j) is a step function; #(/x) = for \x < 0, #(/x) = 1 for /x > 0. The estimator for 
/x, which we denote by fj,*, is the function of the measurements, /x(xj), that maximizes w. 
Since w = — oo for /i < 0, /x*must be > 0. We set 

dw x 



ci/x 
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Xi- \l 







er 



(4) 



For the sample mean x = ^ Dili ^i > 0, /x* = x. For x < 0, ^ < for all /x > 0, so 
the maximum of w is at xx* = 0. x has a normal distribution with mean /x and variance 
(T 2 ^ = cr 2 /N. The probability density function for /x* is normal with the usual normalization 
for /x* > and a delta function at /x* = normalized to the remaining probability 
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Thus the probability density function for /x* is given by: 



P(/x» 
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(6) 



The mean and variance of xx* are given by: 
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E(fi*) approaches /i and V(/j,*) approaches a% for N large. For finite N, E(fi*) does not 
equal /x, so fi* is a consistent but not unbiased estimator for \i. It is, however, asymptoti- 



cally unbiased. From Estimation Theory [10-12H we know that If the Likelihood Equation 
has a solution //* which is a consistent estimator of fi, then /j,* is asymptotically normally 
distributed with a mean of fi and a variance of [—NE(d 2 lnf(x\fj,)/dfi 2 )] . V(/x*) equals 
0.340"^ at /i=0, monotonically increasing to a 2 ^ at large \x. For finite N, V(/x*) is smaller 
than o~ 2 N . 



Nevertheless, V(/x*) satisfies the usual Cramer- Rao inequality [12 



vw > 



( dE(^ 2 
\ dfi 



IX 



(9) 



where Ix is the Fisher Information, the usual measure of the information contained in the 

1 — P and Ix — -7T and by explicit calculation one can 

show that 



measurements. One finds — 4^- 

dfj, 



V{if) > (1 - P )V 



2„2 

N 



a 



N 



(10) 
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We note that //* does not satisfy the criteria for sufficiency However for the purpose 
of supplying a point or interval estimate for this special case where there is a boundary, it 
contains all of the necessary information. (For x < 0, the best estimate of \x is zero.) We 
demonstrate the construction of the 68.27% central confidence belt, in units of o~n = cr/y/N, 
in Fig. |3|. We invoke the Neyman construction and select, for any given value of //, the 
"central" interval of /x* that contains 68.27% of the /1* sampling distribution. For /i=0, 50% 
of the jjl* probability distribution is associated with /i*=0. The remaining 18.27% of the 
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68.27% belt is contained in the /i* interval between and what we call <5 M . A straightforward 
calculation gives er/(5 M /v / 2) = 2 x 0.1827, or ^=0.475. 

As jjl increases from to 1, the upper endpoint of the 68.27% interval rises linearly with 
unit slope. For /x > 1, the central 68.27% interval in /i* is /i — 1 < /i* < fx + 1. It is the 
requirement of exactly 68.27% coverage, and the fact that the finite probability associated 
with ii* = must be taken into account, that introduces a discontinuity in the central 
interval at /i = 1. 

Once the confidence belt is constructed, as in Fig. |3|, it follows from the Neyman method 
that confidence intervals of ft with corresponding coverage can be read off as vertical intervals 
of the belt for any measured x. We need only keep in mind that all x < correspond to 
//* = 0. 

In our formulation, the necessary "lift up" 0] of the estimate from an unphysical to a 
physical value and/or the raising of an upper bound to a non-null value comes naturally 
from the estimator derived from the Likelihood Function. In other approaches, |2],[5| the 
"lift-up" is obtained somewhat arbitrarily by ad hoc procedures or by specifying an ordering 
principle. The latter methods do not solve the problem that, in the words of Ref. |2[],"in 
some (rare) cases it is necessary to quote an interval known to be wrong." 

B. Poisson variates with background 



We consider n to be a single Poisson distributed variate with non-negative signal mean 
/i and known mean background b. Let p(n\m) = m n e~ m /n\ denote the Poisson probability 
for obtaining the measurement n when the mean is m. Then 

f{n\n) =p{n\n + b) (11) 

L(Ai) = finWin) (12) 

w(/x) = InL(fi) = nln(fi + b) — (fi + b) — ln{n\) + ln0{jj) (13) 
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(X* is the function of n that maximizes L and is thus the estimator for /i. For n > b, 
fi* = n — b. For n < b, ft* = 0. Thus the estimator for fi is non-negative. The probability 
of fi* for a given /i is P(fi*\fi, 6) = p(fi* + b\fi + b) for /i* > and a value at fi* = given by 
Sn<bP( n l/ i + &)■ Rather than work with the estimator fi*, it is more convenient to define an 
integer estimator, n*, such that n* = for n < b and n* = n — b~ for n > b, where b~ is the 
largest integer less than or equal to b. Thus n* = fi* + {b — b~). 

We demonstrate the construction of the 90% confidence belt by means of an example, 
shown in Fig. ^, where the known mean background b is equal to 2.8. b is chosen non- 
integer to illustrate this slightly more complicated case. We also show the confidence belt 
consisting of central intervals [ni(fio), n 2 (/io)] |@ containing at least 90% of the probability 
for unknown Poisson mean fi in the absence of any known background (dotted) and the 
90% one-sided belt consisting of intervals [0, n os (/j, )] (dashed). Our 90% confidence belt is 
defined only for /i > and fi* > 0. We define a coordinate system (n*,//) by placing the 
ordinate fi = at fi = b and choosing the integer abscissa value n* = to coincide with 
n = b~ . 

Let fj,' be the largest value of /i such that [ni(/i ),n 2 (/io)] contains b~. (In the example 
given, /j,' = 6.2, corresponding to a value of /i = 6.2 — 2.8 = 3.4, and n os (fi' ) = 9.) For 
< \x < n' Q — b (i.e. b < jiq < fi' ), the 90% horizontal interval is [b~ ,n os (fi )}. For \x > fi f — b 
(i.e. /i > /^o), the 90% horizontal interval is [ni(fjLo),n2(fJ>o)]- The resulting confidence belt 
is shown in solid lines. The set of joined horizontal and vertical line segments is simple 
and continuous and no compensatory remedies are required. To obtain the 90% confidence 
intervals for /i, given a measurement n, we need simply find the appropriate vertical interval 
from the plot. By the Neyman construction, it has > 90% coverage. 

Let [ci(m), 02(171)] denote the usual (i.e. in the absence of known background) Poisson 
90% confidence interval for the mean, /xq, for m observed events (the dotted horizontal lines 
in Fig. f|) . Also, let c os {m) denote the usual 90% one-sided lower limit for m observed 
events (the dashed horizontal lines in Fig. |]). Then for n < b, n* = and we obtain the 
upper limit for /i of C2(b~) — b. For b < n < n os (b) we obtain the upper limit 02(1%) — b; 
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for n os (b) < n < n os (n' ) we obtain the interval [c os {n) — b,C2(n) — b}; for n = n os (fj,' ) + 1 
we obtain the interval [fi' — b, 02(71) — b] and for n > n os (n' ) + 1 we obtain the interval 
[ci(n) — b,C2(n) — b]. We note that any Poisson interval with known background can be 
obtained from a single figure or table. 

It is straightforward to generalize to the case of N independent measurements. For 
measured mean n > b, //* = n — b. For n < b, fi* = 0. The probability for \j* is Poisson for 
11* > plus a value at \x* — normalized to the remaining probability. 

P<ji*\li, b),» >0 = v {N{n* + b)\N(jt + b)) (14) 

P(0\fi,b)= Yl p{m\N(fM + b)) (15) 

m<Nb 

In this case we can find the confidence interval for //* by relabeling the axes in Fig. |] as 
follows: n — > Nn, fio — > NfiQ, n* — > iVn*, \x -^ Nfi, and the origin of the inner coordinate 
system is (Nb~,Nb). 

V. MASS SQUARED OF THE ELECTRON NEUTRINO 

As an example we obtain the 68.27% confidence interval for the mass squared of the 
electron neutrino, disregarding the possibility that the source of negative measurements is 
physics (fitting to the wrong function) rather than statistical variation. Using the mea- 
surement quoting the smallest error, that of Ref. [14| giving —22 ± 4.8 eV 2 , and assuming 
Gaussian probability we obtain the interval [0, 4.8]. The classical Neyman interval is null 
and the interval offered by Ref. § is [0, 0.02] @. 

VI. CONCLUSION 
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We have demonstrated a rigorous method for obtaining frequentist confidence intervals 
that incorporates the physical constraints of the problem into the Likelihood Function, thus 
yielding an estimator that is suitable to the presence of physical boundaries. Using a central 
ordering principle, we obtain either upper limits or central intervals with a smooth transition. 
The intervals are physical in that they support a high degree of belief that the true value is 
within the interval, avoiding the pathologies of null or unphysically small intervals and the 
consequent possibility of obtaining a better result (smaller confidence interval) for a worse 
experiment. The construction is not equivalent to the Likelihood Ratio Construction which 
does not give satisfactory intervals near unphysical regions. 
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FIG. 1. Confidence belt, in the usual construction, giving 68.27% central confidence intervals 
for the unknown mean of a Gaussian with variance a 2 , in units of ajy = a/y^N), where x is the 
sample mean of N measurements. 
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FIG. 2. The classical construction of the 90% central confidence belt (solid) for unknown 
non-negative Poisson signal [i in the presence of a Poisson background with known mean b taken to 
be 3.0, where n is the result of a single observation. Here hq = jjl + b is the parameter representing 
the mean of signal plus background. For n = the confidence interval is null. 
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FIG. 3. Confidence belt, in our construction, giving 68.27% central confidence intervals for the 
unknown mean of a Gaussian with variance a 2 , in units of <jn = cr/yN, where x is the sample 
mean of TV measurements. For x < 0, /i* = and the interval is [0, 1]. For < x < 5^ the interval 
is [0, x + 1]. For 6/j, < x < 1 + <5 M it is [x — Sfj,, x + 1]. For 1 + 5^ < x < 2 the interval is [1, x + 1] 
and for x > 2 we obtain the usual central interval [x — 1, x + 1]. 5 M = 0.475. 
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FIG. 4. The 90% central confidence belt (solid) for unknown non-negative Poisson signal [i 
in the presence of a Poisson background with known mean b taken to be 2.8, where n is the 
result of a single observation. We show the confidence belt consisting of central intervals [ni(^o)j 
^2(^0)] containing at least 90% of the probability for unknown Poisson mean /zo in the absence 
of background (dotted) and the 90% one-sided belt consisting of intervals [0, n os (//o)] (dashed). 
For no < 2.62, only one-sided intervals can be constructed. For b = 2.8, b~ = 2, /i' = 6.2, 
n os(b) = 5, and n os (fi' Q ) = 9 (see text for definitions). For n < b, the confidence interval for /j, 
is [0,C2(b~) — b = 3.4] and the examples given are for n < 2. For b < n < 5, the interval is 
[0, C2(n) — b] where the example given is for n = 4 and the interval is [0, 5.8]. For 5 < n < 9, the 
interval is [c os (n) — b, C2(n) — b] where the example given is for n=7 and the interval is [1.1, 9.7]; for 
n = 10, the interval is [n' — b, C2(n) — b]; and for n > 10, the interval is [ci(n) — 6, C2(n) — b]. Here 
[ci(m), C2{m)\ is the Poisson central 90% confidence interval and c os {m) is the one-sided Poisson 
Vo lower limit, both for a single observation giving m in the absence of any known background. 
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